A Hierarchical IRT Model for Criterion-Referenced Measurement

نویسندگان

  • Rianne Janssen
  • Francis Tuerlinckx
  • Michel Meulders
  • Paul De Boeck
چکیده

A hierarchical IRT model is proposed for mastery classification in criterionreferenced measurement. In this model, items measuring the same criterion are grouped, and a difficulty and discrimination parameter of the criterion is estimated on the same scale as the person and item parameters. The level of proficiency of a student with respect to the criterion is determined by the probability of success on the criterion. Cutoff points on the probability scale can be used to classify respondents into masters and nonmasters. The hierarchical IRT model is estimated using the Gibbs sampler and tested using posterior predictive checks. The model is illustrated with a test measuring the attainment targets of reading comprehension (in Dutch) at the end of primary education.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Marginal True-Score Measures and Reliability for Binary Items as a Function of Their IRT Parameters

This article provides analytic evaluations of population true-score measures for binary items given their item response theory (IRT) calibration. Under the assumption of normal trait distribution, the expected values of marginalized true scores, error variance, true score variance, and reliability for norm-referenced and criterion-referenced interpretations are presented as a function of the it...

متن کامل

Measurement Error in Hierarchical Gain Score Modeling

This paper compares three approaches for solving the problem of measurement error in a hierarchical gain score model. The pre-test and post-test scores are IRT scores with measurement error. Explanatory variables at student level and class level are considered in the model. Simulation results show that the gain score model that does not consider measurement error overestimates the explanatory v...

متن کامل

MEASURING AND DETECTING DIFFERENTIAL ITEM FUNCTIONING IN CRITERION-REFERENCED LICENSING TEST A Theoretic Comparison of Methods

The validity of a measurement instrument depends on the quality of the items included in the instrument. The overall aim was to compare methods for detecting and measuring differential item functioning, DIF, in order to find a suitable method for examining DIF in a dichotomously scored criterion-referenced licensing test. The methods were discussed with respect to whether they are parametric, t...

متن کامل

Multilevel IRT Modeling in Practice with the Package mlirt

Variance component models are generally accepted for the analysis of hierarchical structured data. A shortcoming is that outcome variables are still treated as measured without an error. Unreliable variables produce biases in the estimates of the other model parameters. The variability of the relationships across groups and the group-effects on individuals’ outcomes differ substantially when ta...

متن کامل

Bridging the semantic gap for software effort estimation by hierarchical feature selection techniques

Software project management is one of the significant activates in the software development process. Software Development Effort Estimation (SDEE) is a challenging task in the software project management. SDEE is an old activity in computer industry from 1940s and has been reviewed several times. A SDEE model is appropriate if it provides the accuracy and confidence simultaneously before softwa...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2000